Picture for Juntao Li

Juntao Li

Where Matters More Than What: Decoding-aligned KV Cache Compression via Position-aware Pseudo Queries

Add code
Mar 12, 2026
Viaarxiv icon

LongFlow: Efficient KV Cache Compression for Reasoning M

Add code
Mar 12, 2026
Viaarxiv icon

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Add code
Jan 24, 2026
Viaarxiv icon

MemoryRewardBench: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 24, 2026
Viaarxiv icon

$\texttt{MemoryRewardBench}$: Benchmarking Reward Models for Long-Term Memory Management in Large Language Models

Add code
Jan 17, 2026
Viaarxiv icon

Accelerate Speculative Decoding with Sparse Computation in Verification

Add code
Dec 26, 2025
Viaarxiv icon

Overview of CHIP 2025 Shared Task 2: Discharge Medication Recommendation for Metabolic Diseases Based on Chinese Electronic Health Records

Add code
Nov 09, 2025
Viaarxiv icon

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Add code
Oct 26, 2025
Viaarxiv icon

LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling

Add code
Oct 08, 2025
Viaarxiv icon

BatonVoice: An Operationalist Framework for Enhancing Controllable Speech Synthesis with Linguistic Intelligence from LLMs

Add code
Sep 30, 2025
Viaarxiv icon